MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites

نویسندگان

  • Christoph D. Schmid
  • Philipp Bucher
چکیده

Chromatin immunoprecipitation combined with massively parallel sequencing methods (ChIP-seq) is becoming the standard approach to study interactions of transcription factors (TF) with genomic sequences. At the example of public STAT1 ChIP-seq data sets, we present novel approaches for the interpretation of ChIP-seq data.We compare recently developed approaches to determine STAT1 binding sites from ChIP-seq data. Assessing the content of the established consensus sequence for STAT1 binding sites, we find that the usage of "negative control" ChIP-seq data fails to provide substantial advantages. We derive a single refined probabilistic model of STAT1 binding sequences from these ChIP-seq data. Contrary to previous claims, we find no evidence that STAT1 binds to multiple distinct motifs upon interferon-gamma stimulation in vivo. While a large majority of genomic sites with high ChIP-seq signal is associated with a nucleotide sequence resembling a STAT1 binding site, only a very small subset of the over 5 million potential STAT1 binding sites in the human genome is covered by ChIP-seq data. Furthermore a surprisingly large fraction of the ChIP-seq signal (5%) is absorbed by a small family of repetitive sequences (MER41). The observation of the binding of activated STAT1 protein to a specific repetitive element bolsters similar reports concerning p53 and other TFs, and strengthens the notion of an involvement of repeats in gene regulation. Incidentally MER41 are specific to primates, consequently, regulatory mechanisms in the IFN-STAT pathway might fundamentally differ between primates and rodents. On a methodological aspect, the presence of large numbers of nearly identical binding sites in repetitive sequences may lead to wrong conclusions about intrinsic binding preferences of TF as illustrated by the spacing analysis STAT1 tandem motifs. Therefore, ChIP-seq data should be analyzed independently within repetitive and non-repetitive sequences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DNA binding specificity of different STAT proteins. Comparison of in vitro specificity with natural target sites.

STAT transcription factors are expressed in many cell types and bind to similar sequences. However, different STAT gene knock-outs show very distinct phenotypes. To determine whether differences between the binding specificities of STAT proteins account for these effects, we compared the sequences bound by STAT1, STAT5A, STAT5B, and STAT6. One sequence set was selected from random oligonucleoti...

متن کامل

DNA binding properties of the Arabidopsis floral development protein AINTEGUMENTA.

The Arabidopsis protein AINTEGUMENTA (ANT) is a member of a plant-specific family of transcription factors (AP2/EREBP) that share either one or two copies of an approximately 70 amino acid region called the AP2 repeat. DNA binding activity has been demonstrated previously for members of this family containing a single AP2 repeat. Using an in vitro selection procedure, the DNA binding specificit...

متن کامل

A Conserved Motif in the Linker Domain of STAT1 Transcription Factor Is Required for Both Recognition and Release from High-Affinity DNA-Binding Sites

Binding to specific palindromic sequences termed gamma-activated sites (GAS) is a hallmark of gene activation by members of the STAT (signal transducer and activator of transcription) family of cytokine-inducible transcription factors. However, the precise molecular mechanisms involved in the signal-dependent finding of target genes by STAT dimers have not yet been very well studied. In this st...

متن کامل

Crystal structure of actinomycin D bound to the CTG triplet repeat sequences linked to neurological diseases.

The potent anticancer drug actinomycin D (ActD) acts by binding to DNA GpC sequences, thereby interfering with essential biological processes including replication, transcription and topoisomerase. Certain neurological diseases are correlated with expansion of (CTG)n trinucleotide sequences, which contain many contiguous GpC sites separated by a single base pair. In order to characterize the bi...

متن کامل

A retroviral promoter and a cellular enhancer define a bipartite element which controls env ERVWE1 placental expression.

The HERV-W family contains hundreds of loci diversely expressed in several physiological and pathological contexts. A unique locus termed ERVWE1 encodes an envelope glycoprotein (syncytin) involved in hominoid placental physiology. Here we show that syncytin expression is regulated by a bipartite element consisting of a cyclic AMP (cAMP)-inducible long terminal repeat (LTR) retroviral promoter ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2010